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Remote health monitoring frameworks gained significant attention due to 
their real intervention and treatment standards. The proposed work intends to 
design an artificial intelligence (AI) based remote health monitoring 
framework for predicting heart disease and diabetes from the given medical 
datasets. In this framework, the smart devices are used to gather the health 
information of patients, and the obtained information is integrated together by 
using different nodes that includes the detecting node, visualization node, and 
prognostic node. Then, at that point, the health care dataset preprocessing is 
performed to standardize the characteristics by recognizing the missing 
qualities and taking out the unessential characteristics. Consequently, the 
unified levy modeled crow search optimization (ULMCSO) algorithm is 
employed to select the optimal features based on the global fitness function, 
which helps increase the accuracy and reduce the training time of the 
classifier. Finally, the probabilistic guided naive distribution (PGND) based 


classification model is utilized for predicting the label as to whether normal 
or disease affected. During an evaluation, two different datasets, such as 
PIMA and Hungarian, are used to validate and compare the results of the 
proposed model by using various performance measures. 
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1. INTRODUCTION 

Public health monitoring is one of the most essential and fundamental concerns that need to be focused 
on to prevent people from health hazards [1]. In current days, most people are highly affected by heart disease 
and diabetes, which are considered life-threatening diseases because it affects the other parts of the nerve 
system and kidney. Due to the low-quality treatments and inefficient clinical diagnosis, most healthcare 
systems [2], [3] could use the decision making systems for proper disease prediction and diagnosis. Because 
earlier disease identification helps to prevent people from harmful diseases. Recently, developing an automated 
remote monitoring system is more vital for observing the healthcare status [4], [5] of patients, which is 
accomplished by using health devices, smartwatches, and smart mobile phones. The primary purpose of using 
the remote monitoring system is to provide proper treatments to the patients during emergency cases. For this 
purpose, an intelligent machine learning-based smart health framework is constructed in this work, which helps 
to consolidate the patient data [6] from the smart devices for early detection of risks and keep the healthcare 
professionals updated with the present health information patients. In this framework, the detecting node, 
visualization node, and prognostic node have been used to gather the health information from patients for 
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generating the alert to the healthcare professionals [7]. This pipeline can be implemented with the modules of 

data imputation, optimization and classification. Here, the data imputation is performed for scaling the data 

obtained from the medical datasets, and it helps to improve the quality of patterns. After that, the feature 

selection can be done by using the novel optimization technique to reduce the dimensionality of data to improve 

the prediction rate. Finally, an advanced artificial intelligence (AI) based machine learning technique can be 

implemented for accurately predicting the given data into normal or abnormal classes. The main contributions 

of this paper are as follows: 

— To obtain the health information of patients from the smart devices, three different nodes such as detecting 
node, visualization node, and prognostic node have been utilized in this framework. 

— To fine-tune the given input datasets by eliminating the noise and normalizing the attributes, the dataset 
preprocessing is performed initially. 

— A unified levy modeled crow search optimization (ULMCSO) based feature selection algorithm is 
employed to select the features based on the global fitness value with reduced iterations. 

— The probabilistic guided naive distribution (PGND) classification algorithm is deployed to predict the 
classified output as normal accurately or disease affected. 

— To validate the efficiency and results of these approaches, the two different and most popular datasets, 
such as Hungarian and PIMA, are utilized in current system. 

The overflow bits of this paper are structuralized into the going with pieces: Section 2 takes apart the 
continuous social occasion, movement, and solicitation perspectives utilized for somewhat investigating the 
success status of patients. Section 3 presents the working systems for the proposed work with its general stream 
and estimations. Section 4 assesses the presentation and near examination of existing and proposed procedures 
utilizing different measures. At last, the general paper is summed up with its future work in section 5. 


2. RELATED WORKS 

This segment surveys a portion of the regular works connected with information bunching, 
enhancement, AI, and profound learning methods used to foster a far off medical care checking framework for 
nursing patients with various kinds of infections. Moreover, it analyzes the benefits and obstacles of the 
ongoing procedures as shown by their basic features and used norms. Gondalia et al. [8] introduced an 
automated internet of things (IoT) based health monitoring system by using the machine learning technique for 
observing the health status of war soldiers. The main intention of this paper was to accurately identify the 
location of soldiers for monitoring their health status, who injured in the battlefield. Here, the health of soldiers 
were monitored by using the GPS control, and sensors like (heartbeat and temperature). Moreover, the k-means 
clustering algorithm was utilized in this work for analyzing the information obtained from the sensors. 
According to the sensor inputs, the different types of actions like sitting, walking or running in case of wound 
or blasting have been accurately predicted by clustering the attributes. However, the k-means clustering was 
not more suitable for this kind of real time applications due to its reduced efficiency, global clustering, and 
varying initial partitions. Malasinghe et al. [9] presented a comprehensive survey for the selecting the suitable 
technique to develop a remote patient health monitoring system. Also, the different types of security issues 
associated to the e-health systems were discussed with its appropriate solutions. Ramkumar et al. [10] designed 
a new remote patient monitoring framework using animproved machine learning technique. The pilot data has 
been utilized in this system for analysis, where the wearable technology could be used to monitor the health 
status of patients. Chatrati et al. [11] encouraged a canny home prosperity really taking a look at framework 
for affirmation the diabetes and circulatory strain patients from a good way. 

The principal reason for this work was to dissect the glucose readings of the patients at their home for 
giving the medical care office on the off chance that anomaly. Moreover, the circulatory strain was likewise 
identified by utilizing the mix of direction and AI models. Here, the support vector machine (SVM) request 
strategy was utilized to arrange the diabetic and non-diabetic patients considering the data planning set. 
Moreover, the model optimization algorithm was utilized to optimally select the features for improving the 
accuracy of classification, which also helps to reduce the overfitting. Still, this work requires to develop the 
proper graphical user interface for automatically transferring the medical status of patients to the doctors at the 
time. Vitabile et al. [12] utilized a smart health monitoring system for analyzing the psychological conditions 
and health status of patients based on the measures of chest sounds, temperature, blood pressure, heart rate, 
and electrocardiograms (ECG). Here, the blockchain methodology has been utilized to ensure the privacy and 
security of medical data. The primary advantage of this system was, it has better ability to handle the large 
dimensional data with reduced time delay. Nair et al. [13] employed a spark-based machine learning 
methodology for remotely predicting the health status of patients. Here, the decision tree classification 
mechanism was utilized to predict the disease based on the attributes obtained from the dataset. Also, it was 
more capable for handling the huge datasets based on the partitioning of feature models. Moreover, this 
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classification model could efficiently reduce the generalization error and overfitting value. Still, it follows some 
computational steps for predicting the classified labels, which was the key limitation of this work. 

Li et al. [14] implemented a multi-stationary approach for remotely monitoring the health status of 
older people. In this framework, two different types of sensor have been utilized to predict the disease, which 
includes radar and wearable sensors. For improving on the cycle and expanding the exactness of grouping, 
different element choice models have been used in this work, which incorporates the channel strategies, 
covering techniques, and implanted strategies. Besides, the K-nearest neighbor (KNN) and SVM request 
approaches have been used for predicting an infection, and the got consequences of these methods were looked 
at in light of the proportions of precision and handling time. According to the validation, it was analyzed that 
the SVM technique outperforms the KNN model with improved performance results. Kaur et al. [15] deployed 
a random forest classification technique for designing an efficient health monitoring system to remotely 
monitor the health status of patients. Here, the IoT based healthcare system was developed for predicting 
chronic diseases at the time of medical emergencies. Based on the types of various disease, the performance of 
classification techniques such as KNN, linear SVM, random forest, decision tree, and multilayer perceptron 
were verified and compared by using the measures of accuracy, area under curve (AUC), accuracy, and review. 
In view of the acquired outcomes [16], it was analyzed that the random forest technique outperforms the other 
techniques with increased detection accuracy. 

Verma and Sood [17] presented a haze helped IoT empowered structure for remotely checking the 
medical care information through the brilliant passages. Here, the event triggering data transmission methodology 
could be used for analyzing the patient’s health data based on the temporal health index. Azimi et al. [18] 
implemented an edge based deep learning methodology for designing a hierarchical healthcare monitoring 
framework. The primary motivation behind this work is to guarantee the expanded accessibility and exactness 
of order at the hour of ongoing checking. In this system, the medical care give could acquire the clinical 
information from the cloud server through the sensor gadgets associated in the organization. The primary 
advantage of this work was, it efficiently reduced the response time by properly monitoring the health 
information. Moshin et al. [19] presented a multilayer taxonomy for developing an efficient real time remote 
health monitoring system. Here, the wireless body area network (WBAN) telemedicine architecture model was 
developed for monitoring the patient’s health information using various body sensor devices. It includes blood 
pressure sensor, ear sensor, motion sensor, electroencephalogram (EEG), ECG and electromyogram (EMG) 
sensors, which gathers the medical information from the human body and transmits it to the healthcare 
professional through wireless technology. 

Devarajan et al. [20] constructed an energy efficient fog assisted health monitoring framework for 
remotely monitoring the diabetic patients. The main purpose of this work was to reduce the latency, complexity, 
and improve the energy efficiency of healthcare support system. Hassan et al. [21] introduced the remote pain 
monitoring architecture using an IoT incorporated Fog technology for e-healthcare system. This paper objects 
to accomplish the reduced execution cost, time delay, and resource consumption required for developing an 
efficient web application. Moreover, the different types of e-healthcare services have been validated in this 
work, which includes robotic services, audio/video communication, and monitoring application. In this 
framework, the fog node gathers the patient health information by using the actuators and sensors, and the 
obtained data were transferred to the cloud storage using the proxy server. This framework works in light of 
the early bird gets the worm planning process, subsequently it was not more appropriate for all sort of health- 
related crisis applications, which was the significant restriction of this work. Hasan et al. [22] sent a fix based 
continuous medical care observing framework utilizing an IoT innovation. The principal motivation behind 
this work was to create an exceptionally solid eHealth system for lessening both the expense and human 
exertion. Here, the low power radio recurrence innovation could be utilized for laying out the information 
transmission with diminished piece mistake rate Table 1. Contains the accuracy of the different algorithems 
used and utilization of the IoT devise. 

Here, the SVM grouping method was used to classify Based on the survey, it is examined that the 
current works are exceptionally centered around fostering a far-off medical services framework for observing 
the wellbeing status of patients utilizing the savvy gadgets like sensors, shrewd watches and other brilliant 
gadgets. Still, it faced the following limitations: 

— Inefficient data transmission between the sensors and storage components. 

— Increased time delay of processing. 

— High bit error rate and complexity in classification. 

— Reduced system performance. 

Hence, the proposed work objects to design an efficient remote health monitoring framework for tracking the 
health status of patients using an advanced AI mechanism. 
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Table 1. Comparative analysis between the existing algorithms 


References Technique Description Utilization of | Accuracy 
IoT device 
Makhadmeh and Deep belief neural network An IoT weable medical device has been Yes 99% 
Tolba [23] (DBNN) utilized in this model for data gathering. 
Mohan et al. [24] Random forest (RF) integrated It objects to predict the cardio vascular No 88.9% 
with linear model disease. 
Nalluri and Roy [25] Multi-objective optimization- A hybrid disease detection framework is No 94% 
based classification developed. 
Haq et al. [26] Intelligent machine learning A hybridized framework is developed No 89% 
model for predicting the heart disease. 
Uyar and {than [27] GA integrated with recurrent It intends to design a heart disease No 97.8% 
fuzzy NN detection framework. 
Ahmed [28] K-nearest neighbor (KNN) It developed an IoT based heart rate Yes 96% 
Classification monitoring framework. 
Nazari et al. [29] Fuzzy AHP model A likelihood-based disease detection No 95% 
system is developed. 
Vivekanandan and Differential evolution (DE) It objects to implement the machine No 83% 
Iyengar [30] integrated with fuzzy NN learning classifier for heart disease 
prediction. 
Ali et al. [31] Statistical DNN classification It intends to develop a heart disease No 93.3% 
detection system. 
Khiarak et al. [32] Meta-heuristic optimization- The cardiac disease diagnosis system is No 94% 
based classification developed. 


3. PROPOSED METHOD 

This part examines about the functioning model of proposed far off wellbeing checking framework 
utilizing progressed enhancement and arrangement approaches. The principal commitment of this paper is to 
foster a wellbeing checking system for partner the clinical data got from the patients utilizing the individual 
savvy gadgets. In light of this clinical information, the diabetes dangers and heart chances are anticipated in 
before for giving clinical updates to the patients. The proposed smart healthcare monitoring framework and its 
components are shown in Figure |. Here, the information like patient name, age, gender, height/weight, heart 
rate, blood pressure, glucose level, calories, medication level, and blood oxygen have been gathered using the 
sensor nodes. In this framework, the one-time authentication is performed for fusing the medical information 
obtained from the sensors. Then, the mobile application can directly extract the required data from the cloud 
platforms, hence the user does not required to enter their data, which is one of the key benefit of this system. 
For this purpose, the GoogleFit cloud service is utilized in this model that maintains the medical information 
of patients obtained through smart devices. Moreover, the application programming interfaces (API) has been 
utilized to establish the connection and data transmission between the end-user devices and software arbitrators. 


Gluco Smart Smart 
Meter Watch Phones 


Blood Pressure 
Monitor 


Google Fit 
API 


Companion App 


Diabetic Risk 
Yes/Np 


Glucose, BP, Age, Insulin, 


Doctors BMI, and Pregnancies 


Figure 1. Smart remote healthcare framework 


The different types of nodes exist in this framework are as follows: 
a) Detecting node — The main functionality of this node to collect the patient medical information from the 
smart devices. 
b) Visualization node — It combines all the medical information obtained from various sources for making 
it as accessible in the web portal. 
c) Prognostic node — In which, the machine learning classifier is utilized to predict the disease according to 
the subset of features extracted from the patient data. 
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The working flow of the proposed methodology is shown in Figure 2, and its corresponding 
architecture is depicted in Figure 3, which holds the following modules: 
— Data preprocessing and normalization 
— ULMCSO based feature selection 
— PGND classification 

Initially, the data preprocessing and normalization are performed to improve the quality of dataset by 
identifying the missing values, and eliminating the irrelevant information. Because, the disease prediction 
results are highly depending on the attributes of dataset, hence it must be fine-tuned for accurate detection and 
classification. After that, the feature scaling is performed to standardize the values of attributes in the given 
dataset. Consequently, the ULMCSO algorithm is applied to optimally select the features for reducing the 
dimensionality and increasing the accuracy of classifier. Then, the selected features are used to train the 
classifier for predicting the disease of patients with reduced computational complexity and increased detection 
efficiency. Here, the PGND based classification technique is implemented to accurately detect the disease 
based on the features selected from the given dataset. 
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Figure 3. Architecture model of the proposed system 
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3.1. Dataset normalization and preprocessing 

In any case, the dataset preprocessing and normalization processes are performed for tuning the data 
by recognizing the missing characteristics and killing the unessential properties. Besides, the missing 
information substitution and clamor evacuation are considered as the fundamental activities in dataset 
preprocessing, in light of the fact that the commotion free information assists with acquiring a superior 
identification rate. Here, the input dataset is examined initially for analyzing the number of data to identify the 
missing attributes based on the median value. During this cycle, the information is organized in a rising request 
for processing the middle worth. Thus, the irrelevant qualities and missing qualities are supplanted with the 
assessed middle worth. After that, the data normalization is performed by standardizing the values in the range 
of 0 to 1, which helps to evaluate the patterns of disease. For data normalization, the standard deviation is 
estimated by using the regression model as shown in (1) and (2). 


D=py,t+pS+eE, fori=1,2..n (1) 
D; = PotpiS+& (2) 


By using these mathematical models, the residual value is computed. Where, D indicates the pair of 
data, p_0 and p_iare defined as the least square values, S is the input data, and ¢_i denotes the error value. 
The average values are estimated for the sample data by using the standard deviation as shown in (3): 


= = Si (3) 
d 
where, o indicates the standard deviation, S; represents the input data and Fg is the frequency of data. 
Consequently, the data normalization is performed as shown in (4) and (5), 


Dy = (4) 


an 


Si- ; 
Dy =—=—*t (5) 


ai 


where, Dy indicates the normalized data, €; is the residual value, and 6, defines the variance. Then, this 
preprocessed data can be used for the optimization and classification processes. 


3.2. ULMCSO based feature selection 

Close to the completion of preprocessing, the ideal number of components are browsed the dataset by 
using the proposed ULMCSO computation, which helps with diminishing the dimensionality of features. The 
primary motivation behind applying the ULMCSO procedure is to prepare the information model of classifier 
by utilizing the ideal number of highlights. Typically, the feature selection algorithms are mainly in the 
prediction or classification application systems During this cycle, the information is organized in a rising 
request for processing the middle worth. Thus, the irrelevant qualities and missing qualities are supplanted with 
the assessed middle worth. Increasing the overall prediction performance. Here, the optimization is performed 
by incorporating the functionalities of levy flight and CSO mechanisms. The proposed ULMCSO is a kind of 
meta-heuristic optimization technique, which is used to identify the global best fitness function for optimally 
choosing the most reasonable highlights. The principal advantages of utilizing this strategy are as per the 
following: 
— Increased convergence speed 
— Reduced future dimensionality 
— Minimal time consumption for classifier training and testing 
— Increased classification accuracy 

Due to the food hiding ability and memory capacity, the crows are generally considered as an 
intelligent bird that uses its own knowledge for food collection and storage. Also, the CSO streamlining 
approach is widely used in numerous application frameworks for settling the complex multi-objective 
enhancement issues. Because of its wasteful looking through capacity and space, the CSO strategy limits with 
the issue of decreased combination rate. Thus, the toll flight procedure is consolidated with the CSO calculation 
for further developing the general intermingling velocity, and precision. Moreover, the hybrid ULMCSO can 
identify the best fitness value with reduced number of iterations. In this mechanism, the set of populations F, 
are randomly initialized at first with N number of crows. Here, the maximum count for iteration is considered 
as m,, and the position of c’” crow having the dimensional searching space dand iteration t is computed by 
using the (6): 


A remote health monitoring framework for heart disease and diabetes prediction using ... (Adari Ramesh) 


852 Oo ISSN: 2502-4752 
Chee = [Cr Crean Cy (6) 


where, c= 1,2... F, andt= 1,2 ...m,. In this model, each crow has the ability to evoke the location or position 
for hiding the food source, which accomplished before starting the next iteration as estimated as (7). 


At = [hoe hee oo heel (7) 


Moreover, the crow ccan use the random path for safeguarding the food source from the other crow, 
which is represented the random path selection of optimization. In this integrated algorithm, the random path 
is identified by using the levy flight algorithm, where the searching probability is predicted for analyzing the 
behavior of crow. Let, consider that the random number W, is uniformly distributed between the range of 0 to 
1, which is represented as (8). 


W, = Levy~x = a“, Where (1 < A < 3) (8) 
This operation is explained by using the follow model (9), 


Crop + We Ame, * (Beye — CTy.)We = AP 
Random placement Otherwise 


Cret+1 = (9) 


where, Am, defines the amplitude of crow c, Be, denotes the identified best possible solution of crow v, and 
W, is the random number. Finally, the memory vector is updated by using the (10): 


Cret+1 Uf O(Cra4) if better than (Be, +) 


1 
Bect Otherwise oD) 


Crot41 = 


where, O(.) indicates the objective function. By using this value, the best optimal function is identified for 
selecting the features to train the classifier. 

Crow search algorithm | is a new type of swarm intelligence optimization algorithm proposed by 
simulating the crows' intelligent behavior of hiding and retrieving deseased the algorithm has the characteristics 
of simple structure, few control parameters, and easy implementation 


Unified Levy Modeled Crow Search Optimization (ULMCSO) - Algorithm-I 


Input: Preprocessed Dataset; 

Output: Optimal selection of features; 

Step 1: Initialize the set of populations F,, Number of crows N, maximum count for iteration 
m,, dimensional searching space d and iteration t; 

Step 2: The population initialization with n number of crows with t iteration Cr, is 
represented in equ (1); 

Step 3: Then, the location or position of hiding food source by the crow c is estimated by 
using the equ(7); 

Step 4: Generate the random number between the range of [0, 1] by using the levy flight 
modeling as shown in equ (8); 

Step 5: Then, its random placement C7roz4, is updated by using equ (9); 

Step 6: Update the memory function according to the amplitude Am,,, best possible solution 
Be,,, and random number W,. 

Step 7: Based on the objective function O(.), the best optimal solution is computed for 
optimally selecting the number of features; 


3.3. PGND classification 

Subsequent to smoothing out, the classifier is ready by using the best game plan of components that 
helps with chipping away at the accuracy and diminishes the time use of taking care of. In this work, the PGND 
order system is carried out to recognize the illness from the given datasets by utilizing the ideal number of 
chosen highlights. It is a kind of probabilistic mixture model mainly used for labeling the dataset based on the 
Gaussian distribution value. In this algorithm, the model parameters are selected by using the maximum 
likelihood function obtained from the available data. During this process, the sample mean and covariance 
values are computed for predicting the class with increased accuracy. The key benefit of using this After 
streamlining, the classifier is prepared by utilizing the ideal arrangement of elements that assists with working 
on the precision and decreases the time usage of handling. In this work is it effectively avoids the data over 
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training problem with increased robustness. Also, it is more suitable for handling the large dimensional datasets 
with reduced time consumption and error rate. Because, the hyper parameter tuning is performed with zero 
mean, and unit variance measures. Initially, the multi-variate distribution function is computed with T number 
of Gaussian distributions as shown in (11): 


p(opt;|ch; = t) = N(Me;, Coz) (11) 


where, opt; is the optimized dataset, ch, indicates the current operating condition, Me, is the mean value, and 
Co, defines the covariance. Also, there are Tnumber of parameters used to define the feature space model of 
{(Me,, Co,), (Me,, Coz) .... (Me;,, Co,)}. Consequently, the statistical model of mixing properties is estimated 
by using the (12): 


O = {(Me,, Co,, 1), (Me2, Coz, Tp) ... (Me, Coy, F;)} (12) 


where, I’, indicates the mixing proportions of class t € S, T = {T,,T, ... 1}. Then, the normal inverse wischart 
distribution NIW is computed based on the conjugate of distribution as shown in (13). 


p(Me,, Co,) = NIW (mko, 1%, do, MVo) (13) 


Where, mk, is the prior mean of Me,, MV, is the prior mean of Co;, 7) and qo indicates the strength 
of prior. Consequently, the identity matrix is constructed [ID x ID] with the ddimensional vector, and the 
distribution over the labelled space is denoted by using the (14). 


pl) = Dir(w)o TT, Pe" (14) 


Where, the hyper parameters w = {wy, ...@} that is used to integrate the posterior probability of each 
class. Then, its equally weighted factor w, = ave is computed for each class, and the generative statistical 


model p(ch;, opt;,©) is also determined. Consequently, the labelled data LDg is used to generate the initial 
number of classes T, where the model parameters are estimated by using the Bayesian function. Moreover, the 
posteriori probability is estimated and updated with the model parameters as shown in (15). 


p(Me,, Co,|ch; = t, LDg) = NIW(mky, %, In MV,,) (15) 


The parameters mk, 1, Gg, MVp are computed as following (16)-(19): 


mkn = mere mk + aes Opt; (16) 
Th =%+Nch (17) 
Qn = 40 +N (18) 
MV,, = MV, + MV + rpmkymk§ — r,mk,mks (19) 


where, n; indicates the number of observations in the labelled data LDz, and opt, defines the sample mean 
with the label t. Then, the sum of square matrix computed for each class t is shown in (20). 


MV = ¥." opt,opt? (20) 


t=1 


The posterior probability is estimated based on the categorical distribution of the dirichlet function as 
shown in (21), 


ACL © (21) 


according to the posteriori distribution function, the Bayes rule is applied to predict the classified label from 
the given data as shown in (22). 


p(Opt;|chy=t,LDq)p(chy=t|LDq) 
p(opt;|LDq) 


p(ch; = t|opt;,LDq) = (22) 


By utilizing this model, the characterized mark is anticipated as whether typical or illness impacted. 
The significant advantages of utilizing this instrument are as per the following: 
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— Demands least measure of investment for preparing and testing the models. 
— Limited computational intricacy. 

— Expanded discovery proficiency and exactness. 

— Guaranteed unwavering quality and versatility. 


4. RESULTS AND DISCUSSION 

This section validates the results of both existing and proposed remote disease detection 
methodologies by using various evaluation metrics, where the MATLAB/simulation tool is utilized to obtain 
the results. For this analysis, the diabetics and heart disease detection datasets are used, which includes 
Hungarian and PIMA (Indian Diabetes Database). The Hungarian dataset comprises 76 attributes, in which 
some of the essential attributes are listed in Table 2. Figure 4 which has Figure 4(a) and Figure 4(b) shows the 
confusion matrix of the heart disease dataset with respect to the classes of 0 and 1. Based on the results, it is 
analyzed that the proposed mechanism accurately predicts the classes with increased TPR. Similarly, the PIMA 
dataset is obtained from the machine learning repository, which holds 768 number of samples with 9 different 
feature attributes. Also, it comprises the patient information aging from 21 to 81, and its attribute information 
are listed in Table 3. 


Table 2. Hungarian dataset description 


Description 
Age Young, medium, old or very old 
Gender Male and female 
Maximum Heart Rate (MHR) Low, medium, high or very high 
Chest Pain (CPT) Type 1- typical 


Type 2 — Atypical 
Type 3 — Non-anginal 
Type 4 — Asymptomatic 


Resting Blood Pressure (RBP) Low, medium, high or very high 
Blood sugar (FBS) BS > 120 mg/dl 
Serum Cholesterol (SCH) Low, medium, high or very high 
Major vessels (VCA) Major vessels (0 to 3) 
Thallium scan (TCA) Normal, fixed defect, and reversible defect 
Exercise Induced Angina (EIA) Yes—1 
No-0 


Depression Induced by Exercise (OPK) Low, Risk or Terrible 


Class0 Class 1 Class0O Class1 
Class 0 Class 0 [}:)1)) 180 
Class 1 Class 1 350 1700 0.93 


0.95 0.96 


(a) (b) 


Figure 4. Confusion matrix for Hungarian dataset (a) training and (b) testing 


Table 3. PIMA dataset description 


Attributes Description 
No of pregnancies Numerical attribute 
Blood Pressure (BP) mmHg 
Age Young, middle, old or very old (in terms of years) 
BMI Body mass index 
Insulin mU/mL 
Glucose level Estimated in terms of mg/dL 
Skin type Mm 
Diabetes Pedigree Yes—1 
No-0 
No of pregnancies Numerical attribute 
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Figure 5 and Table 4 looks at the current [33] and proposed arrangement procedures in view of the 
proportions of accuracy, review, precision, and f-measure. These parameters are extensively used in many 
detection/classification application systems for analyzing the efficiency of methodologies. Also, the overall 
performance of classifier is highly depending on the improved values of these measures, which are calculated 
as follows (21)-(24): 


TP+TN 
ACCUr ACY = Fo tN+RP+FN Gl) 
Precision = —~ (22) 
TP +FP 
Recall or TPR = —— (23) 
TP +FN 
F — Measure = —— (24) 
2TP +FP+FN 


where, TP — true positives, TN — true negatives, FP — false positives, and FN — false negatives. Based on the 
estimatedresults, it is analyzed that the proposed ULMCSO-PGND outperforms the other approaches with 
increased precision, recall, accuracy, and f-measure values. 


100 


= 80 
: = 

4 
a 
tw 20 
» 0 

wd eS 454 ; 

: at 35 F oS os rs 
€ a 
= : 
-¥ 


@Precision Recall BAccuracy F-Measure 


Figure 5. Similar investigation among existing and proposed procedures utilizing Hungarian dataset 


Table 4. Accuracy, sensitivity, and specificity analysis using the Hungarian dataset 


Methods Precision Recall Accuracy F-Measure 
SVM 87.5 81.5 84.4 84.5 
LR 89.2 95.2 92.2 92.2 
MLP 93.3 85.3 89.3 89.3 
RF 87.4 87.4 87.3 87.4 
DT 84.6 777 77.6 77.6 
NB 88.8 78.5 83.4 83.4 
Ensemble DL 98.2 96.4 98.5 97.2 
Proposed 98 98.5 98.5 98 


Figure 6 and Table 5 thinks about the root mean squared blunder (RMSE) and mean absolute error 
(MAE) worth of both existing and proposed arrangement strategies utilizing the Hungarian dataset. These are 
the error measures calculated as shown in (25), (26): 


RMSE = [pate _ 2,2 (25) 


MAE = ~YN, |x, -— &| (26) 


here N is the total number of observations, x; is the actual value, and %; denotes the predicted value. From the 
obtained results, it is evident that the proposed ULMCSO-PGND technique provides the reduced error outputs, 
when compared to the other techniques. Because, the normalization and feature selection processes could 
efficiently reduce the error values by perfectly training the classifier with the optimal features. 
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Figure 6. RMSE and MAE 


Table 5. RMSE and MAE using the Hungarian dataset 
Methods RMSE _ MAE 


SVM 0.39 0.15 
LR 0.22 0.11 
MLP 0.27 0.10 
RF 0.34 0.13 
DT 0.39 0.31 
NB 0.38 0.36 


Ensemble DL 0.21 0.12 
Proposed 0.13 0.9 


Figure 7 and Table 6 compares the overall accuracy of both existing and proposed classification 
techniques by using the Hungarian dataset. Typically, the overall accuracy is estimated for analyzing the total 
efficiency and improved performance of the detection mechanism. According to this evaluation, it is observed 
that the proposed ULMCSO-PGND technique outperforms the other approaches with increased accuracy. Due 
to the optimal selection of feature, the training model of classifier is improved, which helps to obtain an 
increased accuracy value. 


100 Table 6. Overall accuracy analysis using 
S go ail i Hungarian dataset 

>, 60 Methods Overall accuracy 
& il q GFLS 78.7 
= 40 it 1 il | Ensemble classifier 85.4 
4 20 fi i) i i Type 2 - FL 86 
0 . ; Hybrid ML 88.4 
n os Vw hm Pens s DL 89 
Berea Zed S56 ss ANN 1 
Cegankz Sk a o & Kernal RF 91 
gas zs Sa MLP 92 
ee m4 Fuzzy Diagnosis system 92.3 
DT 92.8 
: awe ANFIS 94.1 
Figure 7. Overall accuracy of existing and proposed Ensemble DL 98.5 
classification techniques Proposed 99 


Figure 8 and Table 7 glances at the current [34] and proposed portrayal techniques considering the 
accuracy of disclosures. Where the PIMA dataset is considered for examination. The acquired outcomes 
portray that the proposed ULMCSO-PSND strategy gives an expanded exactness, when contrasted with 
different procedures. 

Figure 9 and Table 8 looks at the current [35] and proposed arrangement methods in view of the 
proportions of exactness, accuracy, and review utilizing the PIMA dataset. In the proposed system, a cross 
breed ULMCSO calculation finds the best ideal arrangement with diminished number of emphases and 
expanded combination rate, which assists with choosing the most fit set of highlights. Then, these features are 
used to train the classifier that improves the entire performance of system with increased accuracy, precision, 
and recall values. 
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Table 7. Comparative analysis based on accuracy 


ay using PIMA dataset 
iS 40 Methods Accuracy (%) 
a 20 Firefly with cuckoo optimization- 81 
= 0 based classification 
8 PS DSSS S$ PD Feed forward NN 82 
“ CP OSS PSs * NB 79.56 
ee LDA with MWSVM 89.74 
S vS> GA with NN 87.46 
s K-means with DT 90 
K-means with PCA HD 


ULMCSO-PGND 98.5 
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Figure 9. Overall comparative analysis using PIMA dataset 


Table 8. Accuracy, precision, and recall analysis using PIMA dataset 


Methods Accuracy (%) _ Precision(%) _ Recall (%) 
LR 73.05 72.7 13 
RF TIA 75.7 76.9 
Fine-tuned MLP 86.08 85.1 86.6 
Proposed 98.5 99 99 


5. CONCLUSION 

This paper presented a new remote health monitoring framework using an advanced AI method for 
predicting the different types of disease. The main contribution of this paper is to remotely monitor the health 
status of patients for disease identification, and proper diagnosis. In this framework, the one-time authentication 
is performed for fusing the medical information obtained from the sensors. Then, the mobile application can 
directly extract the required data from the cloud platforms, hence the user does not require to enter their data, 
which is one of the key benefits of this system. For this purpose, the Google Fit cloud service is utilized in this 
model that maintains the medical information of patients obtained through smart devices. Moreover, the 
different types of nodes such asdetecting node, visualization node, and prognostic node are used to incorporate 
the medical information obtained from the different sources. During sickness expectation, the givenclinical 
datasets are preprocessed and standardized at the underlying state for working on the exhibition of classifier. 
Then, at that point, the ULMCSO computation is applied to pick the components for decreasing the 
dimensionality and extending the accuracy of classifier preferably. After enhancement, the PGND 
characterization component is executed to identify the sickness from the given datasets by utilizing the ideal 
number of chosen highlights. For approving the outcomes, two famous datasets, for example, PIMA and 
Hungarian are utilized to assess the presentation of proposed model. Additionally, the acquired outcomes are 
contrasted and the new best in class approaches as far as exactness, accuracy, review, f-measure, and mistake 
rate. From the evaluation, it is analyzed the proposed outperforms the other techniques with improved 
performance values. 
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